SEQUENCE RECORD 

(1) GENERAL INDICATIONS: 

(i) APPLICANT: 

(A) NAME: Deutsches Krebsf orschungszentrum 

(B) STREET: Im Neuenheimer Feld 280 

(C) TOWN: Heidelberg 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 69120 

(ii) TITLE OF THE INVENTION: Multivalent ^Antibody 
Constructs 

(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER-READABLE VERSION: 

(A) DATA CARRIER: floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, version 
#1.30 (EPA) 

(2) INDICATIONS AS TO SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 28.. 1689 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) POSITION: 28.. 1689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAATTCATTA AAGAGGAGAA ATTAACC ATG AAA TAC CTA TTG CCT ACG GCA 51 

Met Lys Tyr Leu Leu Pro Thr Ala 
1 5 . 



GCC GCT GGC TTG CTG CTG CTG GCA GCT CAG CCG GCC ATG GCG CAG GTG 99 
Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Gin Val 
10 15 20 

CAA CTG CAG CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG 147 
Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val 
25 30 35 40 

AAG ATG TCC TGC AAG GCT TCT GGC TAC ACC TTT ACT AGG TAG ACG ATG 195 
Lys Met Ser Cys Lys Ala Ser Gly Tvr Thr Phe Thr Arg Tyr Thr Met- 
45 50 55 

CAC TGG GTA AAA CAG AGG CCT GGA CAG GGT CTG GAA TGG ATT GGA TAC 2 '4 3 

His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie Gly Tyr 
60 65 70 

ATT AAT CCT AGC CGT GGT TAT ACT AAT TAC AAT CAG AAG TTC AAG GAC 291 
lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Phe Lys Asp 
75 80 85 

AAG GCC ACA TTG ACT ACA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAA 3 39 

Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr Met Gin 
90 95 100 

CTG AGC AGC CTG ACA TCT GAG GAC TCT GCA GTC TAT TAC TGT GCA AGA 387 
Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tvr Cys Ala Arg 
105 110 115 ^ 120 

TAT TAT GAT GAT CAT TAC AGC CTT GAC TAC TGG GGC CAA GGC ACC ACT 43 5 

Tyr Tyr Asp Asp His Tyr Ser Leu Asp Tyr Trp Gly Gin Gly Thr Thr 
125 130 135 

CTC ACA GTC TCC TCA GCC ?JiA ACA ACA CCC AAG CTT GGC GGT GAT ATC 483 
Leu Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp lie 
140 145 150 

TTG CTC ACC CAA ACT CCA GCT TCT TTG GCT GTG TCT CTA GGG CAG AGG 531 
Leu Leu Thr Gin Thr Pro Ala Ser Leu Ala Val Ser Leu Gly Gin Arg 
155 160 165 

GCC ACC ATC TCC TGC AAG GCC AGC CAA AGT GTT GAT TAT GAT GGT GAT 57 9 

Ala Thr lie Ser Cys Lys Ala Ser Gin Ser Val Aso Tyr Asp Gly Asp 
170 175 180 

AGT TAT TTG AAC TGG TAC CAA CAG ATT CCA GGA CAG CCA CCC AAA CTC 627 
Ser Tyr Leu Asn Trp Tyr Gin Gin lie Pro Gly Gin Pro Pro Lys Leu 
185 190 195 200 

CTC ATC TAT GAT GCA TCC AAT CTA GTT TCT GGG ATC CCA CCC AGG TTT 675 
Leu lie Tyr Asp Ala Ser Asn Leu Val Ser Gly lie Pro Pro Arg Phe 
205 210 215 

AGT GGC AGT GGG TCT GGG ACA GAC TTC ACC CTC AAC ATC CAT CCT GTG 72 3 

Ser Gly Ser Gly Ser Gly Thr Aso Phe Thr Leu Asn lie His Pro Val 
220 225 230 



GAG AAG GTG GAT OCT GCA ACC TAT CAC TGT CAG CAA AGT ACT GAG GAT 
Glu Lys Val Aso Ala Ala Thr Tyr His Cys Gin Gin Ser Thr Glu Asd 
235 * 240 245 



CCG TGG ACG TTC GGT GGA GGC ACC AAG CTG GAA ATC AAA CGG GCT GAT 
Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys Arg Ala Asp 
250 255 260 

GCT GCG GCC GCT GGT GGT GGT GGT TCT GGC GGC GGT GGT AGC GGT GGT 
Ala Ala Ala Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly- 
265 270 275 280 

GGC GGC TCC GGT GGT GGT GGT AGC CAG GTG CAG CTG CAG CAG TCT GGG 
Gly Gly Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Gin Gin Ser Gly 
285 290 295 

GCT GAG CTG GTG AGG CCT GGG TCC TCA GTG AAG ATT TCC TGC AAG GCT 
Ala Glu Leu Val Arg Pro Gly Ser Ser Val Lys lie Ser Cys Lys Ala 
300 305 310 

TCT GGC TAT GCA TTC AGT AGC TAC TGG ATG AAC TGG GTG AAG CAG AGG 
Ser Gly Tyr Ala Phe Ser Ser Tyr Trp Met Asn Trp Val Lys Gin Arg 
315 320 325 

CCT GGA CAG GGT CTT GAG TGG ATT GGA CAG ATT TGG CCT GGA GAT GGT 
Pro Gly Gin Gly Leu Glu Tro lie Gly Gin lie Trp Pro Gly Asp Gly 
330 335 340 

GAT ACT AAC TAC AAT- GGA AAG TTC AAG GGT AAA GCC ACT CTG ACT GCA 
AsTD Thr Asn Tyr Asn Gly Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala 
345 350 355 360 

GAC GAA TCC TCC AGC ACA GCC TAC ATG CAA CTC AGC AGC CTA GCA TCT 
Aso Glu Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Ala Ser 
365 370 375 

GAG GAC TCT GCG GTC TAT TTC TGT GCA AGA CGG GAG ACT ACG ACG GTA 
Glu Asp Ser Ala Val Tyr Phe Cys Ala Arg Arg Glu Thr Thr Thr Val 
380 385 390 

GGC CGT TAT TAC TAT GCT ATG GAC TAC TGG GGT CAA GGA ACC TCA GTC 
Gly Arg Tyr Tyr Tyr Ala Met Asp Tyr Trp Gly Gin Gly Thr Ser Val 
395 400 405 

ACC GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC GTG 
Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp lie Val 
410 415 420 

CTC ACT CAG TCT CCA GCA ATC ATG TCT GCA TCT CCA GGG GAG AAG GTC 
Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly Glu Lys Val 
425 430 435 440 

ACC ATG ACC TGC AGT GCC AGC TCA AGT GTA AGT TAC ATG AAC TGG TAC 
Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met Asn Trp Tyr 
445 450 455 



CAG CAG AAG TCA GGC ACC TCC CCC AAA AGA TGG ATT TAT GAG ACA TCC 1443 
Gin Gin Lys Ser Gly Thr Ser Pro Lvs Arg Tm lie Tyr Asp Thr Ser 
460 465 470 

AAA CTG GOT TCT GGA GTC CCT GOT CAC TTC AGG GGC AGT GGG TCT GGG 1491 
Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser Gly Ser Gly 
475 480 485 

ACC TCT TAC TCT CTC ACA ATC AGC GGC ATG GAG GCT GAA GAT GCT GCC 1539 
Thr Ser Tyr Ser Leu Thr lie Ser Gly Met Glu Ala Glu Asd Ala Ala- 
490 495 500 

ACT TAT TAG TGC CAG CAG TGG AGT AGT AAC CCA TTC ACG TTC GGC TCG 1587 
Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr Phe Gly Ser - 
505 510 515 520 

GGG ACA AAG TTG GAA ATA AAC CGG GCT GAT ACT GCA CCA ACT GGA TCC 1635 
Gly Thr Lys Leu Glu lie Asn Arg Ala Aso Thr Ala Pro Thr Gly Ser 
525 530 535 

GAA CAA AAG CTG ATC TCA GAA GAA GAG CTA AAC TCA CAT CAC CAT CAC 1683 
Glu Gin Lys Leu lie Ser Glu Glu Asn Leu Asn Ser His His His His 
540 545 550 

CAT CAC TAATCTAGA 16 98 

His His 



(2) INDICATIONS AS TO ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
15 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu 
20 25 30 

Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly 
35 40 45 

Tyr Thr Phe Thr Arg Tyr Thr Met His Trp Val Lys Gin Arg Pro Gly 
50 55 60 



Gin Gly Leu Glu Trp lie Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr 
65 70 75 80 



Asn Tyr Asn Gin Lys Phe Lys Asn Lys Ala Thr Leu Thr Thr Asd Lys 
85 90 95 

Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp 
100 105 110 

Ser Ala Val Tyr Tyr Cys Ala Arg Tvr Tyr Asp Asp His Tyr Ser Leu 
115 120 ' 125 

ASTD Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr 
130 135 140 

Thr Pro Lys Leu Gly Glv Astd lie Leu Leu Thr Gin Thr Pro Ala Ser 
145 150 155 160 

Leu Ala Val Ser Leu Gly Gin Arg Ala Thr lie Ser Cys Lys Ala Ser 
165 170 175 

Gin Ser Val Asp Tyr Asp Gly Asp Ser Tyr Leu Asn Trp Tyr Gin Gin 
180 185 190 

lie Pro Gly Gin Pro Pro Lys Leu Leu lie Tyr Asp Ala Ser Asn Leu 
195 200 205 

Val Ser Gly lie Pro Pro Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
210 215 220 

Phe Thr Leu Asn lie His Pro Val Glu Lys Val Asp Ala Ala Thr Tyr 
225 230 235 240 

His Cys Gin Gin Ser Thr Glu Asp Pro Trp Thr Phe Gly Gly Gly Thr 
245 250 255 

Lys Leu Glu lie Lys Arg Ala Asp Ala Ala Ala Ala Gly Gly Gly Gly 
260 265 270 

Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
275 280 285 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Pro Gly Ser 
290 295 300 

Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ala Phe Ser Ser Tyr 
305 310 315 320 

Trp Met Asn Trp Val Lys Gin Arg Pro Glv Gin Gly Leu Glu Trp lie 
325 330 335 

Gly Gin lie Trp Pro Gly Asp Gly Asp Thr Asn Tyr Asn Gly Lys Phe 
340 345 350 

Lvs Gly Lys Ala Thr Leu Thr Ala Asp Glu Ser Ser Ser Thr Ala Tyr 
355 '~ 360 365 



Met Gin Leu Ser Ser Leu Ala Ser Glu Asd Ser Ala Val Tyr Phe Cys 
370 375 380 

Ala Arg Arg Glu Thr Thr Thr Val Gly Arg Tvr Tyr Tyr Ala Met Asd 
385 390 395 400 

Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Ala Lys Thr Thr 
405 410 415 

Pro Lys Leu Gly Gly Asp lie Val Leu Thr Gin Ser Pro Ala Il.e Met 
420 425 430 

Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser 
435 440 445 

Ser Val Ser Tyr Met Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro 
450 455 460 

Lys Arg Trp lie Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala 
465 470 475 480 

His Phe Arg Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie Ser 
485 490 495 

Glv Met Glu Ala Glu Asp Ala Ala Thr Tvr Tyr Cvs Gin Gin Trp Ser 
500 505 510 

Ser Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu lie Asn Arg 
515 520 525 

Ala Asp Thr Ala Pro Thr Gly Ser Glu Gin Lys Leu lie Ser Glu Glu 
530 535 540 

Asp Leu Asn Ser His His His His His His 
545 550 



(2) INDICATIONS AS TO ID ND: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 28.. 1644 



(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) POSITION: 28.. 1644 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GAATTCATTA AAGAGGAGAA ATTAACC ATG AAA TAG CTA TTG CCT ACG GCA 51 

Mec Lvs Tyr Leu Leu Pro Thr Ala 
15^ 

GCC GCT GGC TTG CTG CTG CTG GCA GOT GAG CCG GCC ATG GCG GAG GTG 99 
Ala Ala-Glv Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Gin Val 
10 15 20 

CAA CTG CAG GAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG 147 
- Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Ara Pro Gly Ala Ser Val 
25 30 35 40 

O AAG ATG TCC TGC AAG GCT TCT GGC TAG ACC TTT ACT AGG TAG ACG ATG 195 

Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr Thr Met 
m * 45 50 " ' 55 

2 GAG TGG GTA AAA CAG AGG CCT GGA CAG GGT CTG GA^. TGG ATT GGA. TAG 243 

:n His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie Gly Tyr 

"'i °o 65 

.,p ATT AAT CCT AGG CGT GGT TAT ACT AJ^T TAG AAT CAG AAG TTG AAG GAG 2 91 

lie Asn Pro Ser Arg Gly Tvr Thr Asn Tvr Asn Gin Lys Phe Lys Aso 
U 75 -80 ' 85 

AAG GCC AGA TTG ACT ACA GAG AAA TCC TCC AGG ACA GCC TAG ATG CAA 339 

ilJ Lys Ala Thr Leu Thr Thr Aso Lvs Ser Ser Ser Thr Ala Tvr Met Gin 

1^ 90 95 100 

!^ CTG AGC AGC CTG ACA TCT GAG GAG TCT GCA GTG TAT TAG TGT GCA AGA 3 87 

Leu Ser Ser Leu Thr Ser Glu Aso Ser Ala Val Tvr T^/r Cys A.la Arg 

105 110 * 115 " ' 120 

TAT TAT GAT GAT CAT TAG AGC CTT GAC TAG TGG GGC CAA GGC ACC ACT 43 5 

Tyr Tyr Asp Asp His Tyr Ser Leu Aso Tyr Trp Gly Gin Gly Thr Thr 
125 130 135 

CTG ACA GTC TCC TCA GCC AAA ACA ACA GCC A^^G CTT GGC GGT GAT ATG 4 83 

Leu Thr Val Ser Ser Ala Lvs Thr Thr Pro Lys Leu Gly Gly Asp lie 
140 145 150 

TTG CTG ACC CAA ACT CCA GCT TCT TTG GCT GTG TCT CTA GGG CAG AGG 531 
Leu Leu Thr Gin Thr Pro Ala Ser Leu Ala Val Ser Leu Gly Gin Arg 
155 160 ■ 165 

GCC ACC ATG TCC TGC AAG GCC AGC CAA ACT GTT GAT TAT GAT GGT GAT 579 
Ala Thr lie Ser Cys Lvs Ala Ser Gin Ser Val Asp Tvr Asp Gly Asp 
170 " 175 180 
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AGT TAT TTG AAC TGG TAG CAA CAG ATT CCA GGA CAG CCA CCC AAA CTC 627 

Ser Tyr Leu Asn Trp Tyr Gin Gin lie Pro Gly Gin Pro Pro Lys Leu 
185 190 195 200 

CTC ATC TAT GAT GCA TCC AA.T CTA GTT TCT GGG ATC CCA CCC AGG TTT 67 5 

Leu lie Tyr Asp Ala Ser Asn Leu Val Ser Gly lie Pro Pro Arg Phe 
205 210 215 

AGT GGC AGT GGG TCT GGG ACA GAC TTC ACC CTC AAC ATC CAT CCT GTG 723 
Ser Glv Ser Gly Ser Gly Thr Asp Phe Thr Leu Asn lie His Pro Val, 
220 225 230 

GAG AAG GTG GAT GCT GCA ACC TAT CAC TGT CAG CAA AGT ACT GAG GAT 771 
Glu Lys Val Asp Ala Ala Thr Tyr His Cys Gin Gin Ser Thr Glu Aso 
235 240 245 

CCG TGG ACG TTC GGT GGA GGC ACC AAG CTG GAA ATC AAA CGG GCT GAT 819 
Pro Trp Thr Phe Gly Gly Gly Thr Lvs Leu Glu lie Lys Arg Ala Asp 
250 255 260 

GCT GCG GCC GCT GGT GGC CCA GGG TCG CAG GTG CAG CTG CAG CAG TCT 867 
Ala Ala Ala Ala Gly Gly Pro Gly Ser Gin Val Gin Leu Gin Gin Ser 
265 270 275 280 

GGG GCT GAG CTG GTG AGG CCT GGG TCC TCA GTG AAG ATT TCC TGC AAG 915 
Gly Ala Glu Leu Val Arg Pro Gly Ser Ser Val Lys lie Ser Cys Lys 
285 290 295 

GCT TCT GGC. TAT GCA TTC AGT AGC TAC TGG ATG AAC TGG GTG AAG CAG ' 9 63 

Ala Ser Gly Tyr Ala Phe Ser Ser Tyr Trp Met Asn Trp Val Lys Gin • 
300 305 310 

AGG CCT GGA CAG GGT CTT GAG TGG ATT GGA CAG ATT TGG CCT GGA GAT 1011 
Arg Pro Gly Gin Gly Leu Glu Trp lie Gly Gin lie Trp Pro Gly Asp 
315 320 325 

GGT GAT ACT AAC TAC AAT GGA AAG TTC AAG GGT AAA GCC ACT CTG ACT 1059 
Gly AsD Thr Asn Tyr Asn Gly Lys Phe Lys Gly Lys Ala Thr Leu Thr 
330 335 340 

GCA GAC GA-A TCC TCC AGC ACA GCC TAC ATG CAA CTC AGC AGC CTA GCA 1107 
Ala AsD Glu Ser Ser Ser Thr Ala Tvr Met Gin Leu Ser Ser Leu Ala 
345 350 355 360 

TCT GAG GAC TCT GCG GTC TAT TTC TGT GCA AGA CGG GAG ACT ACG ACG 1155 
Ser Glu Asp Ser Ala Val Tyr Phe Cys Ala Arg Arg Glu Thr Thr Thr 
365 370 375 

GTA GGC CGT TAT TAC TAT GCT ATG GAC TAC TGG GGT CAA GGA ACC TCA 12 03 

Val Gly Arg Tyr Tyr Tyr Ala Met Asp Tyr Trp Gly Gin Gly Thr Ser 
380 385 390 

GTC ACC GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC 1251 
Val Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp lie 
395 400 405 



GTG CTC ACT CAG TCT CCA GCA ATC ATG TOT OCA TCT CCA GGG GAG AAG 1299 

Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly Glu Lys 
410 415 420 

GTC ACC ATG ACC TGC AGT GCC AGC TCA AGT GTA AGT TAC ATG AAC TGG 1347 

Val Thr Met Thr Cvs Ser Ala Ser Ser Ser Val Ser Tyr Met Asn Trp 

425 430 435 440 

TAC CAG CAG AAG TCA GGC ACC TCC CCC AAA AGA TGG ATT TAT GAC ACA 1395 

Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp lie Tyr Asp Tlxr 

445 450 455 

TCC AAA CTG GCT TCT GGA GTC CCT GCT CAC TTC AGG GGC AGT GGG TCT 1443 

Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser Gly Ser 
460 465 470 

GGG ACC TCT TAC TCT CTC ACA ATC AGC GGC ATG GAG GCT GAA GAT GCT 1491 

Gly Thr Ser Tyr Ser Leu Thr lie Ser Gly Met Glu Ala Glu Asp Ala 
475 480 485 

GCC ACT TAT TAC TGC CAG CAG TGG AGT AGT AAC CCA TTC ACG TTC GGC 1539 

Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr Phe Gly 
490 " 495 500 

TCG GGG ACA AAG TTG GAA ATA AAC CGG GCT GAT ACT GCA CCA ACT GGA 1587 

Ser Gly Thr Lys Leu Glu lie Asn Arg Ala Asp Thr Ala Pro Thr Gly 

505 510 515 520 

TCC GAA CAA AAG CTG ATC TCA GAA GAA GAC CTA AAC TCA CAT CAC CAT 1635 

Ser Glu Gin Lys Leu lie Ser Glu Glu Asd Leu Asn Ser His His His 

525 530 535 

CAC CAT CAC TAATCTAGA 1653 
His His His 



(2) INDICATIONS AS TO ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 

15 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu 

20 _ 25 30 

Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly 

35 40 45 
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Tyr Thr Phe Thr Arg Tyr Thr Met His Trp Val Lys Gin Arg Pro Gly 
50 55 60 

Gin Gly Leu Glu Trp lie Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr 
65 70 75 80 

Asn Tyr Asn Gin Lys Phe Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys 
85 90 95 

Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp 
100 105 110 

Se-^ Ala. Val Tyr Tyr Cys Ala Arg Tvr Tyr Asp Asp His Tyr Ser Leu 
115 120 125 

Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr 
130 135 140 

Thr Pro Lys Leu Gly Gly Asp lie Leu Leu Thr Gin Thr Pro Ala Ser 
145 150 155 160 

Leu Ala Val Ser Leu Gly Gin Arg Ala Thr He Ser Cys Lys Ala Ser 
165 170 175 

Gin Ser Val Asp Tyr Asp Gly Asp Ser Tyr Leu Asn Trp Tyr Gin Gin 
180 185 190 

He Pro Gly Gin Pro Pro Lys Leu Leu He Tyr Asp Ala Ser Asn Leu 
195 , 200 205 

Val Ser Gly He Pro Pro Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
210 215 220 

Phe Thr Leu Asn He His Pro Val Glu Lys Val Asp Ala Ala Thr Tyr 
225 230 235 240 

His Cys Gin Gin Ser Thr Glu Asp Pro Trp Thr Phe Gly Gly Gly Thr 
245 250 255 

Lys Leu Glu He Lys Arg Ala Aso Ala Ala Ala Ala Gly Gly Pro Gly 
260 265 270 

Ser Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Pro Gly 
275 280 285 

Ser Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Ala Phe Ser Ser 
290 295 300 

Tyr Trp Met Asn Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp 
305 310 315 320 

He Gly Gin He Trp Pro Gly Asp Gly Asp Thr Asn Tyr Asn Gly Lys 
"325 330 335 
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Phe Lys Gly Lys Ala Thr Leu Thr Ala Aso Glu Ser Ser Ser Thr Ala 
340 345 " 350 

Tyr Met Gin Leu Ser Ser Leu Ala Ser Glu Aso Ser Ala Val Tyr Phe 
355 360 365 

Cys Ala Arg Arg Glu Thr Thr Thr Val Gly Arg Tyr Tyr Tyr Ala Met 
370 375 380 

Asp Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Ala Ly^s Thr 
385 390 395 400 

Thr Pro Lys Leu Gly Gly Asp lie Val Leu Thr Gin Ser Pro Ala lie 
405 410 415 

Met Ser Ala Ser Pro Gly Glu Lvs Val Thr Met Thr Cys Ser Ala Ser 
420 * 425 430 

Ser Ser Val Ser Tyr Met Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser 
p 435 440 445 

]2 Pro Lys Arg Trp lie Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro 

450 455 460 

M Ala His Phe Arg Glv Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie 

li 465 ' 470 475 480 

p Ser Gly Met Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 

=r 485 490 495 

O Ser Ser Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu lie Asn 

ig 500 505 510 

|- Arg Ala Asp Thr Ala Pro Thr Gly Ser Glu Gin Lys Leu lie Ser Glu 

1^ 515 520 525 

\jz Glu AsD Leu Asn Ser His His His His His His 

530 535 



(2) INDICATIONS AS TO ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
, (A) DESCRIPTION; /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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TATATACTGC AGCTGCACCT GCGACCCTGG GCCACCAGCG GCCGCAGCAT CAGCCCG 57 

(2) INDICATIONS AS TO ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 



(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCGTGAATTC CAGGTGCAAC TGCAGCAGTC TGGGGCTGAA CTGGC 45 



(2) INDICATIONS AS TO ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGTCGACGTT AACCGACAAA CAACAGATAA AACG 



34 
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(2) INDICATIONS AS TO ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..348 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) POSITION: 1..348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATG AGA TTT CCT TCA ATT TTT ACT OCT GTT TTA TTC GCA GCA TCC TCC 
Met Arg Phe Pro Ser He Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

ATT CCG GCT GAA GCT GTC ATC GGT TAC TCA GAT TTA GAA GGG GAT TTC 
lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

TTT ATA AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA 
Phe lie Asn Thr Thr He Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

TCT CTC GAG AAA AGA GAG GCT GAA GCT GAA TTC CAG GTG CAA CTG CAG 
Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Gin Val Gin Leu Gin 
85 90 95 

CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG AAG ATG TCC 
Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser 
100 105 110. 

TGCAAG GCT TCT 
Cys Lys Ala Ser 
115 • 
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2) INDICATIONS AS TO ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 1^ 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Gin Val Gin Leu Gin 
85 90 95 

Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser 
100 105 110 

Cys Lys Ala Ser 
115 



(2) INDICATIONS AS TO ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 



(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1 , , 354 
(ix) FEATURE: 

(A) -NAME/KEY: mat_peptide 

(B) POSITION: 1 . . 354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATG AGA TTT CCT TCA ATT TTT ACT OCT GTT TTA TTC GCA GCA TCC TCC 
Met Arg Phe Pro Ser lie ?he Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Aso Glu Thr Ala Gin 
20 25 30 

ATT CCG GCT GAA GCT GTC ATC GGT TAG TCA GAT TTA GAA GGG GAT TTC" 
lie Pro Ala Glu Ala Val lie Gly Tyx Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

TTT ATA AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA 
Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

TCT CTC GAG AAA AGA GAG GCT GAA GCT GA-A TTC ATG GCG GAG GTG CAA 
Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Met Ala Gin Val Gin 
85 90 95 

CTG CAG CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG AAG 
Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys 
100 105 110 

ATG TCC TGC AAG GCT TCT 
Met Ser Cys Lys Ala Ser 
115 



2) INDICATIONS AS TO ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 



lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 - 40 45 
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Aso Val Ala Val Leu 
50 

Phe lie Asn Thr Thr 
65 

Ser Leu Glu Lys Arg 
85 

Leu Gin Gin Ser Gly 
100 

Met Ser Cvs Lys Ala 
115 



Pro Phe Ser Asn Ser Thr 
55 

lie Ala Ser lie Ala Ala 

70 75 

Glu Ala Glu Ala Glu Phe 
90 

Ala Glu Leu Ala Arg Pro 
105 

Ser 



Asn Asn Gly Leu Leu 
60 

Lys Glu Glu Gly Val 

80 

Met Ala Gin Val Gin 
95 

Gly Ala Ser Val Lya 
110 



(2) INDICATIONS AS TO ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

{ii'i) HYPOTHETICAL: no 
(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TCACACAGAA TTCTTAGATC TATTAAAGAG GAGAAATTAA CC 



(2) INDICATIONS AS TO ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE; single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 13: 
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AGCACACGAT ATCACCGCCA AGCTTGGGTG TTGTTTTGGC 



(2) INDICATIONS AS TO ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



AGCACACAAG CTTGGCGGTG ATATCTTGCT CACCCAAACT CCA 

(2) INDICATIONS AS TO ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGCACACTCT AGAGACACAC AGATCTTTAG TGATGGTGAT GGTGATGTGA GTTTAGG 



(2) INDICATIONS AS TO ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 



m 



m 
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(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION:, /desc - "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



(2) INDICATIONS AS TO ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CAGCCGGCCA TGGCGCAGGT GCAACTGCAG CAG 
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TATATACTGC AGCTGCACCT GGCTACCACC ACCACCGGAG CCGCCACCAC CGCTACCACC 
GCCGCCAGAA CCACCACCAC CAGCGGCCGC AGCATCAGCC CG 



60 



102 



